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Abstract 

Mobile P2P technology provides a scalable approach to content delivery to a large number of users on their mobile devices. 

In this work, we study the dissemination of a single content (e.g., an item of news, a song or a video clip) among a population 

of mobile nodes. Each node in the population is either a destination (interested in the content) or a potential relay (not yet 

interested in the content). There is an interest evolution process by which nodes not yet interested in the content (i.e., relays) can 

become interested (i.e., become destinations) on learning about the popularity of the content (i.e., the number of already interested 

nodes). In our work, the interest in the content evolves under the linear threshold model. The content is copied between nodes 

when they make random contact. For this we employ a controlled epidemic spread model. We model the joint evolution of the 

'" "^ copying process and the interest evolution process, and derive the joint fluid limit ordinary differential equations. We then study 

. ^ the selection of the parameters under the content provider's control, for the optimization of various objective functions that aim 

' at maximizing content popularity and efficient content delivery. 

o\ 

C<\ I- Introduction 

^^ Peer-Peer (P2P) architectures help relieve file servers from excessive load by enabling clients to communicate among 

themselves and exchange content, hence providing a scalable approach to content delivery for a large audience. With the 
proliferation of smart phones and high speed mobile Internet, there is increasing download of media content directly into 
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"^ mobile devices. Mobile P2P technology aims to leverage this trend and several possible architectures [1] have been proposed. 

I In this paper we are concerned with peer-to-peer spread of the content under the delay tolerant networking (DTN) paradigm. 

While studying content distribution, it is essential to understand the evolution of demand for the content. For modeling 

demand evolution, we can adopt the point of view that the existing demand for an item influences others also to get interested 

in it. There has been considerable interest in such models in the area of viral marketing research. For example, online social 

ly-v networks (Facebook, Twitter, etc.) provide a platform for the users to exchange their views about a particular media content 

00 with members of their social circle and end up influencing each other Content creators interested in increasing the popularity 

lO of the content use several data mining techniques to identify optimal viral marketing strategies |2| and to identify the most 

[^ influential users in a social network |^. In this paper, we are also concerned with spread of interest in the item of media 

^^ content, and for this we adopt an influence spread model introduced earlier in the context of viral marketing |3|. 

^""^ Evidently, content providers need to adopt their content dissemination strategies according to the prevailing interest in the 

'. , particular content. Though there have been several works that discuss optimal strategies for content distribution in mobile ad 

>■ hoc networks in the DTN setting |l4l,||3 and in mobile P2P H, they do not address the issue of jointly modeling the spread 

of interest in the content and the content itself. In our work, we aim to model the joint evolution of demand and spread of the 

content in a mobile P2P setting. 

d A recent work Q addresses a related problem of demand-aware distribution by employing fluid models for the viral spread 

of demand and aims at obtaining hybrid P2P and client-server architectures that can meet the demand. While |7| assumes 

that the demand spread is uncontrolled and optimizes for content delivery, in this work, we separately discuss optimizing the 

parameters of the demand evolution process (to increase the popularity the content), as well as the content delivery process 

(to efficiently meet the content demand). 

Contributions: Our main contributions are: 

• Developing ordinary differential equation (o.d.e. ) models for the co-evolution of popularity of the content and its spread 
via controlled epidemic copying in a mobile P2P environment 

• Using these o.d.e. approximations to provide insights into the choice of parameters for the content provider in order to 
optimize various system performance objectives 

In Section III] we introduce the combined system model and the related notations. In Section III we analyse a threshold 
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based model for evolution of interest and derive its fluid limit for general threshold distributions. We also discuss the effect 



of threshold distribution on the model, and show that SIR epidemic model [S] is a special case of our model. In Section IV 



we derive fluid limits for evolution of spread on top of the interest evolution model. We finally provide numerical results for 
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Fig. 1. The HILT Network Model 

some practical optimizations (Section IVli associated with the evolution of interest and the joint-evolution of interest and spread 
of the content. 

II. The System Model 

We consider a scenario where the users with their mobile devices, modeled as a population of mobile nodes, meet each 
other according to a random contact model. We assume that the nodes are homogeneous and we wish to model the spread of 
a single content among these nodes. As motivated in the Introduction, we aim to model the joint spread of popularity of the 
content and the content itself. Pertaining to the content, each node has a current state represented by two bits: the want bit 
indicates whether the node is interested in the content and the have bit indicates whether the node has the content. When the 
want bit of a node is 1, we call the node, a destination (i.e., interested in the content), else we call it a relay (i.e., not yet 
interested in the content). 

There is a central server that keeps track of the want state of all the nodes. The total number of destinations (nodes interested 
in the content) is a measure of popularity of the content. In order to promote the content, this measure of popularity must be 
made known to all the nodes. We assume that there is a low bitrate control channel that (i) is used by the nodes to inform 
the central server about their interest in the content, and (ii) is used by the central server to broadcast, at regular intervals, 
the fraction of nodes interested in the content. Thus, for the purpose of spread of content popularity, the network is fully 
connected. 

Relay nodes, on receiving popularity broadcasts from the central server, might get converted to destinations according to 
an influence spread model. We model this influence process (evolution of interest in the content) using the Homogeneous 
Influence Linear Threshold (HILT) model, a special case of the Linear Threshold model (LT) introduced in Kempe et al. 
||3]. For the content copying process, we model the random contacts between pairs of nodes as independent Poisson point 
processes (a model also used in the context of mobile P2P in [9J), with the copying (when pairs of nodes meet) being controlled 
probabilistically, in a manner similar to the Susceptible-Infective (SI) epidemic model ifTOl . 

A. Modeling Interest Evolution: The HILT Model 

In this section we introduce the HILT model used to model the evolution of interest in the content. In the original LT model 
||3|, nodes are part of a weighted directed graph Q = {Af,£), where £ C J\f x Af. With each i,j € £, there is associated a 
weight Wi,j which gives a measure of influence of node i on node j, normalized such that the total weight into any node is 
at most 1, i.e., J^t ^ij ^ 1- The Homogeneous Influence Linear Threshold (HILT model) is a special case of the LT model 
where the network graph is complete and all nodes are homogeneous. Hence, we have a mesh network on the population J\f 
containing N ~ \M\ nodes with each edge carrying the same influence weight 7jv — jy^i ^"d F < 1 (see Figure lb. 

The evolution of interest in the content is modeled by the following influence process evolving in discrete time. Mach node 
j & M independently chooses a random threshold 6j from a given distribution F at the beginning. We begin with an initial 
set of destinations A{Q). In the HILT model, the net influence of a set of destinations on any relay is 7jv times the size of 
the destination set. The number of destinations (currently interested nodes) is broadcast to all the nodes by the central server 
after each period, and a relay gets converted into a destination once the net influence exceeds its chosen threshold. We also 
assume the progressive case, i.e., conversion into destinations is an irreversible process. In other words, a relay j ^ A{k — 1) 
gets influenced in step k if, 

7jv|^(fc-i)|>e, (1) 

The influence is modeled as building up cumulatively over time. The initial set .4(0) is viewed as being infectious and results 
in some of the relay nodes "tipping" over their thresholds and getting converted to destinations in the first period. A node 
j that remains a relay has the level of cumulative influence on it raised to 7Ar|.4(0)|(< Qj). The nodes in .4(0) are now 
viewed as being non-infectious, and the newly infected nodes are denoted by I?(l), with .4(1) — A{0) +I?(1); see Figure [2] 
Thus, at the end of each period the population will contain three types of nodes: A{k), the set of non-infectious destinations. 
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Fig. 2. Spread of Influence in the HILT Model 
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'D{k) C A{k), the set of newly infected destinations in that period (infectious for the next period) and the set of relays denoted 
by S{k). It is clear that the activation process stops at a random step T when there are no more infectious destinations, i.e., 
I?(T) = 0, and a terminal set A{T) is reached. 

B. Modeling Content Copying: SI Model 

In order to model the content delivery process, we further classify the nodes depending on whether they have the content 
(i.e., based on the have bit). Let X{k) C A{k) denote the set of destinations that have the content, and y{k) C S{k) denote 
the set of relays that have the content. 

For the evolution of {X{k),y{k)), we model a content copying process based on the Susceptible-Infective (SI model). 
Between the discrete time steps, pairs of nodes meet at the points of a Poisson process with rate A, and whenever a node 
that has the content meets a node that does not, content transfer takes place in a probabilistic manner At such a meeting, 
we distinguish between the node that does not have the content, being a destination or a relay, by having different copy 
probabilities a^ = ^ and (Tn = j^, respectively. The scaling with respect to A^ is to ensure that the fluid limit infection rates 
would be a and a respectively [10|. 

C. Co-evolution of Interest and Spread: The HILT-SI Model 

In the combined model, underlying the content delivery process (the SI model), the influence process (the HILT model) 
converts relay nodes into destinations. Thus, in our setup, the fraction of destinations is time-varying (unlike |5|). Also, the 
content spread is dependent on the interest evolution but not vice versa. Figure l3] shows the possible transitions between the 
four sets of nodes. An interesting feature of this model is the importance of copying to a relay node. As a content provider, we 
might be interested in delivering only to the destinations (interested in the content). But, there are two advantages of copying 
to a relay. First, copy to a relay promotes the further spread of the content even to destinations; this is the aspect explored in 
a controlled Markov process setting in [5J. Second, the relay we copy to now might later get influenced (by the HILT model) 
to become a destination, which is a unique feature of the HILT-SI model. 

III. Evolution of Interest in the Content 

In this section and in Section V, we will use the convergence of a scaled discrete time Markov chain (DTMC) to a deterministic 
limit described by an ordinary differential equation (o.d.e.) (Kurtz fT\\, and also Darling Iil2l . lfT3l ). to develop deterministic 
(or, so called, fluid) approximations to the HILT model for interest evolution, and the HILT-SI model for co-evolution of 
interest and content spread. 

Consider the HILT model on N nodes, and with edge weights 7jv = jy^y ■ Recall that O, is the influence threshold of User i, 
1 < i < N . The 8^, 1 < i < A^, are non-negative, i.i.d. random variables chosen according to the cumulative distribution 



function F{-), a continuous distribution with density function /(•). We see that F and the threshold distribution F together 
define the HILT model. They can be seen as modeUng the susceptance level of the population to being influenced by the 
popularity of the content under consideration. 

Note that A{k),k > 1 represents the set of destinations and I?(fc) the set of newly added destinations (infectious) at the 
end of period k with I?(0) — A{0). Define B{k) = A{k — 1) , the set of destination nodes up to period k — 1 with B{0) — 0. 
Thus, for fc > 1, B{k) ~ ^Q<i<k-iT^i- We will be working with sets B{k) and I?(fc) to derive the fluid limits of the HILT 
process. Also, since the nodes are homogeneous in the HILT model, it suffices to record the sizes of the respective sets, and 
not on the exact membership of the sets themselves. Let A{k), D{k) and B{k) be the sizes of the sets A{k), ^{k) and B{k) 
respectively. 

A. O.D.E. Model for Interest Evolution 

We can show that the original HILT process {B{k),D{k)) is a Markov chain (see Appendix [a|. In order to obtain an 
approximating o.d.e., we work with an appropriately scaled Markov process {B^ {t),D^ {t)), which evolves on a time scale 
N times faster than that of the original system. We can visualize this process as evolving over "minislots" of duration 1/iV, 
whereas the original process evolves at the epochs k = 0, 1, 2, • • • . The minislots are indexed by t = 0, 1, 2, • • • . Since this 
new process runs on a faster time scale, we slow down its dynamics by utilizing an approach taken in lfT4l . In the present 
context, the scaling approach can be interpreted as follows. In each minislot, each infectious destination in D^ (t) is permitted 
to influence the relays with probability ^ and its influence is deferred with probability 1 — ^- In the former case, it contributes 
its influence of j^zry ^nd then moves to the set B^{t+ 1), otherwise it stays in the D^{t + 1) set. In the original process, the 
influence of all the newly infected destinations will be taken into account by the central server while announcing the popularity 
level of the content in the next time step, whereas, in the scaled process, only those infectious destinations that choose to use 
their influence will be considered. Define by C^(i) C D^ {t) the set of infectious destinations who use their influence at time 
t. Then, 

B^ {t + I) = B^ {t) + C^ {t) 

and by applying Equation [T] to the relay nodes j e N\A{k), 

D^{t + l)^D^{t)-C^{t) 

-F{^M{B^it) + C^it))) - F{j^B^{t)) 
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1 - F(7^B^(i)) 
N - B^{t) - D^{t)] + Z^{t + 1) 



where Z^ (i+1) and Z^ (^+1) are zero mean random variables conditioned on the history of the process [B^ (0) , D^ (0) , • • • B^ [k) , D^ (fc) 
representing the noise terms, and the expectation in the expression for D^ {t + 1) is with respect to C^ (t). Define B^ [t) = 
— j^ and similarly C^{t) and D^(t) to denote the fraction of users of each type (destination and relay). {B^ (t), D^{t)), 
t = 0, 1, 2, • • • is a DTMC on the state space [0, i, f , • • • 1] x [0, i, • • • 1] with B^ (t) + D^ (t) < 1. 

Then we can state the following theorem on the convergence of the scaled process to their deterministic limits. 

Theorem 1: Given the interest evolution Markov process {B'^ (t), D^{t)), with bounded /(•) for the threshold distribution, 
we have for each T > and each e > 0, 



{ sup \\{B^{[Nu\),D^{[Nu\))-{biu),d{u))\\>e 

\0<u<T 
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where {b{u),d{u)) is the unique solution to the o.d.e.. 



b- 



d= -^ — ^——(1-b-d) -d 
1 - F{Tb) ^ ' 

with initial conditions (6(0) = 0,d(0) = a(0)). 

Proof: This is essentially an instance of Kurtz's Theorem fTTl; see also lfT2l . |fT3 l|. In Appendix IE] we derive equivalent 
conditions for verifying Kurtz's theorem. Refer Appendix [B] for the steps involved in deriving the fluid limit. A detailed proof 



verifying the necessary conditions for Kurtz's theorem for the HILT case is provided in Appendix F-A 



Convergence of scaled HILT process to the ODE solution - Illustration of Kurtz's ttieorem 
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Fig. 4. Convergence of the scaled HILT Markov chain (_B^(fc), D^(fc)) to the o.d.e. limit (fe(t), d(t)). The o.d.e. solution (6(t), d{t)) plotted along with 
sample paths of the scaled process {B^{k), D^ {k)) for N = 50, 100, 500, 1000. 

Comparison of ODE approximation (b(t),d(t)) with the original process (B(k),D(k)) 
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Fig. 5. Multiple sample paths of the unsealed HILT process (B(fc), D{k)) for A^ = 3000 is plotted along with the o.d.e. solution {b{t),d{t)) for T = 0.9 
and do = 0.2 

Remark: We know that the hazard function| 15 1 for the cdf F{x) is given by hp{x) = iLpi^\ where f{x) is the corresponding 
probability density function. Hence the o.d.e. becomes, 

i) = d 
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hF{Tb)Td{l ~ b ~ d) -d 



(2) 
(3) 



B. Accuracy of the O.D.E. Approximation 

Consider the HILT process of interest evolution, under the special case of uniform distribution of influence thresholds, i.e., 
t/[0, 1]. The hazard function corresponding to uniform distribution is given by 
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The corresponding o.d.e. s then become: 
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Figure UJ shows the convergence of the scaled HILT process [B'^ {'lNt\), D'^ {\Nt\)) to the solutions of the above o.d.e. 
(6(t), d{t)) for increasing values of N (50,100,500,1000). F and do were chosen to be 0.9 and 0.2 respectively. We see thatfor 
N = 500, 1000 the o.d.e. approximates the scaled HILT Markov chain very well. 

The original HILT process of interest evolution {B{k), D{k)) is then compared with the solutions of the o.d.e. (6(t), d{t)). 
The results are shown in Figure B] where multiple sample paths of the original discrete time HILT process are superimposed 
on the o.d.e. solutions. We find that the o.d.e. solution approximates the original process really well, and hence permits using 
the fluid limit approximation for sufficiently large TV. 

C. Effect of the Threshold distribution 

In the HILT model, while F is indicative of the total level of influence each individual can receive from the others, the 
threshold distribution F() captures the variation among the individuals' susceptance levels for getting interested the content. 
An empirical analysis on the effect of threshold distributions on collective behavior is available in I.16J . Having established 
the o.d.e. limit and studied its ability to approximated the evolution of interest process, in this section we exploit the o.d.e. 
limit to study the effect of the threshold distribution. 

1) Uniform Distribution: For the HILT model of interest evolution under the uniform threshold distribution, we can state 
the following theorem. 

Theorem 2: Given the initial fraction of destinations d^, in an HILT network with influence weight F under uniform 
distribution, define r = 1 — F + Td^. Then, 




0.3 0.4 0.5 0.6 0.7 
Initial fraction of destinations 

Fig. 6. Final fraction of destinations as predicted by the fluid limit boo (bold lines) and by the discrete formulation "'" — 
values of T and do . 

1) The fluid limit for the evolution of interest is given by, 
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2) The final fraction of destinations is given by 
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(dashed lines) plotted for vaiious 
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Proof: The first part of the theorem is obtained by explicitly solving the o.d.e.s for the HILT model under uniform 
distribution (Equations [4] and Bl) for the initial conditions 5(0) = 0,d(0) = (io(see Appendix [g|i. ■ 

Linear Threshold model under uniform distribution has been studied in discrete setting for general networks in |3]. Since 
in the HILT model all nodes are homogeneous, we will be interested in the influence of a set of size k. Consider the HILT 
network with N nodes and influence weight 7jv- Let I^^ik) be the expected size of the terminal set of destinations A{T), 
starting with ^o of size k as the initial set of destinations. By using results from [17], we can show that. 



/^„(fc) = fc[l + {N- fc)7Ar[l + {N-k^ l)7Jv[l + ■ 



In the expression for I^ 
(refer Appendix [Hli 



,{k), noting lim 



iV7Ar = r and do = -^, we can show that as A^ — > oo. 
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This reconfirms the fact that the fluid model is consistent with the discrete formulation. Also, while ifTTll 
allows us to compute only the final fraction of destinations, our current work provide a good approximation of the actual 
trajectories of the influence process. 

Remarks: Figure 6 shows the behaviour of 6oo for various of F and do- Plotted alongside are the values of -^^ — for the 
corresponding values of 7jv and k for N = 3000 and it can be seen that the two solutions match really well. It is easy to note 
that when F = 0, the HILT process does not have any effect on the number of destinations, i.e., the fraction of destinations and 
relays in the population remains a constant, similar to [5|. We observe from the fluid limit that, as long as F < 1 we cannot 
influence the entire population (i.e., 6oo < 1) unless we start off with the entire population to be destinations (i.e., do = 1). But 
if F = 1, then the influence of the destinations are at the maximum, and we can ultimately convert all nodes into destination, 
given we start with a non-zero fraction of destinations, i.e., 6oo = 1 provided do > 0. 

D. Exponential Distribution 

For the uniform distribution the hazard rate function is hp (x) = j^ which is monotonically increasing in x. Such a hazard 
rate implies that the population consists of relay nodes are more susceptible to getting interested in the content as the total 
number of destinations increases (i.e., total accumulated influence over the past increases). The exponential distribution with 
parameter A yields hpix) = A, a constant hazard rate function. This implies a memoryless property for the influence process, 
i.e., the relay nodes are equally likely to get influenced at a given time instant, irrespective of the net accumulated influence 
in the past. We can then state the following theorem. 

Theorem 3: Given the initial fraction of destinations do, in an HILT network with influence weight F under exponential 
distribution with parameter A, 

1) The fluid limit of the evolution of interest is given by the solution to the o.d.e., 

b = d 

d = AFd(l-6-d)-d 

2) The final fraction of destinations is the solution to the transcendental equation, 

fooo = 1 - (1 - do)e-Ar6^ 

Proof: The first part is obtained by substituting hpix) = A in the HILT o.d.e. (Equations [2] and [3]l. This is equivalent to 
the SIR epidemic model with infection rate AF and recovery rate of 1 ifTSl . Note that b{t) is then equivalent to the Recovered 
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Fig. 7. Effect of threshold distribution in the HILT Model. Plotted are the interest evolution trajectories corresponding to Uniform distribution and Exponential 
distribution (A = 1 and A = 2). 

set (R), and d{t) is equivalent to the Infected set (I). The second part follows from classic SIR literature fSl and observing 
that the basic reproduction number (expected number of new infections from a single infection) is AF. ■ 

Remarks: Figure It] shows the comparison of the interest evolution process {b{t),d{t)) with F — 0.9 and do = 0.2, for 
uniform [0, 1] and exponential distribution with parameter A = 1, 2. Observe that, when the expected values of the thresholds 
are the same (uniform [0, 1] and exponential with A = 2), the exponential distribution results in a higher value of boo, since 
there will be a larger fraction of relay nodes with low sampled threshold. Also since AF > ^^^ , d{0) > and in SIR 
terminology [8 1 a proper outbreak is said to have occurred. As A decreases, we see that the expected influence thresholds of 
the relay nodes increase, and hence we notice a significant drop in the interest evolution process. 

IV. Joint Evolution of Interest and Spread of the Content 

In the previous sections we studied the evolution of interest in the content. In this section, we shall adopt similar techniques 
to obtain the joint evolution of interest and spread of the content (HILT-SI model) in mobile P2P setting. Recall that interest 
evolution is modeled by the HILT model and the actual content transfer is modeled by a probabilistic copying process similar 
to the SI epidemic model. While the HILT model evolves independently, the evolution of the SI model depends upon the HILT 
model, since the copying process takes into account the relay/destination state of the receiving node, indicated by the want 
bit of the node. In this section we will derive an o.d.e. limit for the SI part of the joint process. We shall also assume for 
simplicity that the thresholds in the HILT model are uniformly distributed; the analysis can be easily extended to HILT with 
general distribution F. 

Pairwise meetings of the nodes consitute points of a Poisson process with rate A, and let a^r — f] and ittv — ^ be the 
copy probabilities to the destinations (nodes that are interested in the content) and relays (nodes not yet interested in the 
content) respectively. Recall from Section II-C (Figure Isll X{k) the set of destinations that have the content and y{k) the 
set of relays that have the content with X{k) and Y{k) the sizes of these sets respectively. Taking into account the HILT 
driven conversion of relays that have the content into destinations, we note that both X^ and Yk can be written in terms of the 
binomial random variables P(,){k) and Qxy{k), the first due to the content copying process (SI model) and the second due to 
the interest evolution process (HILT model). We can write, 

X(k + 1) = X{k) + Px{k) + Q,y{k) 
Y{k + 1) = Y{k) + Py{k) - Q^y{k) 

Since the nodes meet at rate A over a given slot, the probability that a destination node without the content, receives the content 
is 1 - e->^°'N(x{k)+Y(k))^ Similarly for a relay node, the probability would be 1 - e->''^N(x(k)+Y(k)) _ ^qx lai-ge N, we can 
then write, 



P,{k) ''^'- Bin 



A(k) - X{k),XaN{X{k) + Y{k)) 



Py{k) ^'■^- Bin 



N - A{k) - Y{k),\(TN{X{k) + Y{k)) 



Note also that due to the Interest Evolution process (HILT model) some relays get converted into destinations. Hence nodes 
from yk (relays that have the content) may transition to Xk (destinations that have the content). Since we are working with 
the uniform distribution, the probability of such a transition for each of the relay nodes would be ^Z" miv We denote by 
Qxy{k) the number of relay nodes that get converted during the /cth time slot. Then, 
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Fig. 8. Comparison of the scaled HILT-SI process for N = 50, 100, 1000 with the con'esponding fluid limits. 

Hence we can write the evolution of the processes X(fc) and F(fc) as follows. 

X(fc + 1)-X(fc) = \at^(X{k) + Y{k)){A{k)-X{k)) 

iNDik) 



l-jNB{k) 



Y{k) + Z,{k) 



Y{k + l)-Y{k) = 



\aN{X{k) 
lND{k) 



Y{k)){N - A{k) -Y{k)) 
-Y{k) + Zy{k) 



l-jNB{k)- 

where Zx{k) and Zy(k) are zero mean noise variables respectively for the Xk and Y^ processes respectively. In order to obtain 
the fluid limit, we work with the scaled process which evolves over mini-slots of width -^. In each mini slot, the probability 
that a destination node without the content, receives the content will then be 1 — e i\^(^('=)+^('=)) and a similar scaling 
occurs for the relay case. The interest evolution scaling in the Qxy{k) term is similar to the probabilistic scaling adopted in 
Section III-A We can then write the scaled processes X^ [t) and Y^ {t) and the corresponding fractional processes X^ {t) 
and F^ (t) (see Appendix pi for details) and state the following theorem, along similar lines to Theorem 111 

Theorem 4: Given the joint evolution Markov process [B^ (t) , D^ (t) , X^ {t) ,Y'^ (t)), we have for each T > and each 
e>0, 

'^^{lNu\),D^{lNu\),X''{lNu\),Y^{lNu\)) 



sup 

0<u<T 



B' 



'(b{u),d{u),x{u),y{u))\ 



> e 



N- 







where {b{u),d{u),x{u),y(u)) is the unique solution to the o.d.e., 

d 



b 
d 



Td 



-(1-b-d)- 
Xa{x + y){a — x) + 



Td 



1 



y 



Tb 
Td 

i-r&' 



(9) 
(10) 

(11) 

(12) 



y = Xa{x + y){l - a - y) - 

with initial conditions (6(0) = 0, d(0) = do, a;(0) — xn, y(0) = j/o)- 

Proof: A detailed proof verifying the conditions for Kurtz's theorem to hold, are presented in Appendix F-B ■ 

A. Accuracy of the Fluid limit 

Figure § shows the convergence of the scaled HILT-SI {B^ {[Nt\), D^ {[Nt\), X'^ {[Nt\),Y'^ {[Nt\)) process to the 
solutions of the above o.d.e {b{t),d{t),x{t), y{t)) for increasing values of A^ = 100, 500, 1000. We see that for N = 1000 the 
o.d.e approximates the scaled HILT-SI Markov chain very well. 

The original HILT-SI process {B{k), D(k),X{k), Y{k)) is then compared with the solutions of the o.d.e. by superimposing 
multiple sample paths of the original discrete time HILT-SI process on the o.d.e solutions. (see Figure l9]l. We find that the o.d.e 
solution approximates the original process really well, and permits using the fluid limit approximation for sufficiently large N. 

V. Numerical results 

In this section we discuss the applications of the fluid limits through numerical results for various optimization problems 
of practical interest. While the first part of the section deals with optimizing for the interest evolution process considered in 
isolation, the latter part deals with optimizations for the joint evolution process. As an example, we work with HILT under 
uniform threshold distribution in this section. 




Fig. 9. Comparison of the unsealed HILT-SI process (A^ = 1000) with the coiTesponding fluid limit. 
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Fig. 10. Variation of dj versus T for various values of /3 with F = 0.9 

A. Interest Evolution 

Content creators are often interested in understanding the evolution of popularity of their content, and would wish to maximize 
the level of popularity achieved. This, in our model, is equivalent to the final fraction of destinations (nodes that are interested 
in receiving the content). Once the content is created, in most cases, the content creator does not have control over influence 
weight r or the threshold distribution F of the population. In order to increase the spread of interest in the content, the only 



parameter that can be controlled is (Iq, the initial fraction of destinations in the population. In Section III we derived the 



relation between do and boo, the final fraction of destinations. We might be interested in choosing the right do which can give 
us the required boo, and we see that by rearranging Equation (8]l we get. 



do 



booi r-T) 
booT 



(13) 



Since the o.d.e. provides a good approximation for the temporal evolution of interest, we can also obtain results for time- 
constrained spread of popularity, i.e., the time taken by the process for the spread of influence is also a constraint, in addition 
to the initial fraction of destinations. 

Theorem 5: Given the initial fraction of destinations do in an HILT network with parameter F, the time we have to wait to 
get the final fraction of destinations to be at least (3 (J3 < ^) is given by. 



T(/3,do,r) = -ln 
r 



1 



1 



do 



where r = 1 — F + Fdp- 

Proof: Firstly, note that since Uoo = boo = —, we cannot reach /3 > ^. Since we are observing the process at a finite 
time T, d{T) is not zero. Hence, we consider a{T) = b(T) + d{T) and set it to /3. We get, 

1 = /? (14) 



a{T) = do{- ^ {^- - l)e-^'^^ 
r r 



Rearranging terms, we get the expression for T{(3,do^T). 
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Fig. 11. Variation of dj versus /3 for various values of F at T = 15 




Fig. 12. Numerical simulation of HILT-SI model for two different values of relay copy probability a = \ and cr = 0.2. 

Another interesting question would be to determine the initial fraction do to be chosen so that by time T we will have at 
least /3 fraction of the nodes in the destination set, in the HILT network with parameter F. This can be solved numerically 



using the following fixed point equation obtained from Equation (14 1. 



^-■rT 



\-r 



Let iJ(do) 



-rT 



and Gido) 



We know that d*^ that solves H{do) 



G{do) will He in [^^^ 



1] and that the 



solution is unique, since a{T) is a monotonic function in do. We also know that for do < ^O' H{dQ) > G(do) and for do > dp, 
H{do) < G(do). Under the above conditions, we can use the iterative bisection method that will converge to dg. 

It is clear that. 



The variation of dp with respect to the parameters /?, F and T can be seen in Figures 10 and 
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For fixed F and T, a higher value of /3 requires a higher do (Figure 1 1 1 



If the network has a high F, then it is sufficient to start with a smaller do to reach a given /3 by T. (Figure 111. 

As the fixed time of interest T is increased (the time constraint is relaxed), the required initial do decreases, from /3 at 

T = and finally reaches do obtained by setting boo = (3 in Equation ( 13 1 (Figure [TO)i. 



B. Joint evolution of interest and spread 

In this section we discuss the optimization problems that might be of interest for the joint evolution process. Given that the 
network parameters F and do are fixed, interest in the content evolves independently. The main motive of the content provider 
would then be to ensure that the content is delivered to as many destinations as possible. 

Recall the o.d.e.s for the HILT-SI model given by Equations 9p2 Since we intend to deliver to the destinations, it would 
be optimal to set a — I. Hence the only control parameter is the copy probability to a relay, a. This may be controlled by 
incentivizing or penalizing copying to relay nodes. As noted earlier, copying to a relay has two advantages, since the relay 
might meet a destination in the future, or might itself get converted into one, by the HILT process. But having a high a 
value, will lead to increase in the number of wasted copies, the number of relays (not interested) that will end up having the 
content. The joint evolutions o.d.e.s have been simulated numerically for a — 1 and two different values of a and are plotted 



in Figure 12 As discussed, having a higher value of a accelerates the x{t) process, but leads to higher value of y{t). Hence 
there is a tradeoff between achieving a high x{t) while keeping y{t) under bounds. We discuss two possible optimization 
problems for a that can be posed, keeping in mind this tradeoff between x{t) and y{t). To understand clearly the effect of a 
on the copying process, we work with F = 0.9 and have fixed the initial fraction of destinations do = 0.2, initial fraction of 
destinations that have the content xq = 0.2. This implies that the set of nodes initially interested in the content is equal to the 
set of nodes that have the content. We assume that the content is delivered a priori to these initial destinations by some other 
means. 

1) Maximize target spread: As content providers, we might be interested to deliver the content to as many destinations as 
possible by some fixed time. This might be the case, when the content is time-dependent and its usefulness expires by that fixed 
time. The fraction of destinations is itself increasing with time, due to the HILT interest evolution process. We would also 
wish to ensure that the number of relays that have the content, by that fixed time, is minimal. Our aim is then to maximize 
the value of x{t) at some given fixed r, subject to keeping ^(t) under control. The problem can be formally posed as, 

max x(t) 

{a:y{T)<C} 

Numerically, we can compute the feasible set {a : y{T) < C,} obtain the optimal solution a* that maximizes x{t). We also 
observe experimentally that for a fixed r, x{t) and j/(t) are monotonically increasing with a. 

Figure 13 shows the variation of a* and the corresponding optimal xjr)* for various values of t and a fixed fraction (. 
Observe that, for r small, we can afford a high value of a since y{t) cannot exceed C, in such a short time. Also, when t 
is large, we can have a high a, provided the interest evolution process is strong enough (high F). This will ensure that the 
number of relays s{t) will be low enough by r, and since y{t) < s{t), Vt, y{t) cannot exceed C,. 
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Fig. 13. Maximize target spread: Tlie optimal solution plotted for a fixed value of (^ and varying r. 




Fig. 14. Maximize target spread: The optimal solution plotted for a fixed value of r and varying f . 



Figure 14 shows the similar variations for various fractions ^ and a fixed time r. As (, the constraint on the number of 
relay nodes with the content at r is relaxed, we see that the optimal a* increases. It can be seen that y(r) stabilizes at the 
value dictated by the number of relay nodes s{t) for high value of cr. Also note that, if the fixed time of interest is high, then 
there is negligible contribution due to increase in a since x{t) increases to a{t) eventually irrespective of a (see Figure 12 1. 

2) Minimize Reach time: Another problem of interest is to deliver the content to a fraction of the destinations as early as 
possible. This might be the case when, delivering to a considerable fraction of destinations accrues some benefit for the content 
provider from the content creator, and would like to earn the benefit as early as possible. Hence we try to minimize the time 
taken to reach a particular value of x{t). Define r,; = inf{i : x{t) > ry}. Then we can formally pose this problem as, 

min r„ 

W--V{r„)<C} 

We numerically compute the feasible set by computing r,, for each a and verifying if y(T^) < C- We experimentally observe 
that for fixed rj, r^ is monotonic decreasing in a and y(T^) is monotonic increasing in cr. 



Figures 15 and 16 show the numerically evaluated optimal solutions for the above optimization problem, for various values 



of r] and C- In Figure 15 it is clear that a higher value of ( allows us to use a high value of a so that we can accelerate the 
content delivery to destinations. As a result, we also see that the time taken to deliver the content decreases with increasing (. 
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Fig. 15. Minimize reach time: The optimal solution plotted for a fixed value of r] and varying (. 
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Fig. 16. Minimize reach time: The optimal solution plotted for a fixed value of (^ and varying r\. 



In Figure 16 note that since we begin with xq ~ 0.2, for 77 < 0.2, t^ = and a can be any value. When rj increases, as 
long as r^ is low enough, we can afford to have a high a and still keep y(T^) under control, since the y{t) process may not 
have enough time to reach (. But as ry increases further, a needs to be reduced, since an increased value of r^, implies, the 
process will run longer and 2/(t^) might then exceed (. 

VI. Conclusions 

In this paper we studied the joint evolution of popularity and spread of content in a mobile P2P environment. We used 
two different models to capture the processes of interest evolution (HILT model) and content delivery (SI model), and derive 
their fluid limits. We then showed that the classic SIR epidemic model can be derived as a special case of the HILT model 
and derived explicit solutions for the model when the thresholds were uniformly distributed. We then used the fluid limits 
to address several optimization problems that might be of practical interest. This work can be extended in several possible 
directions. The co-evolution problem can be studied for other related optimization problems, and we can try obtaining explicit 
optimal solutions for a few special cases. One could generalize the problem for analysing the spread of popularity and spread 
for multiple P2P content. And finally one could consider other models for both interest evolution and for the copying process, 
including those where the processes of interest evolution and content delivery are dependent on each other. 
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Appendix A 
Interest Evolution is a IVIarkov Chain 

Proposition 1: For the HILT model, {B{k),D{k)),k > 0, is a discrete time Markov chain (DTIVIC). 
Proof: Since, for each fc > 0, B{k + 1) = B{k) + D{k), it suffices to show that, for fc > 0, 
P{D{k -f 1) = £|(S(0), i?(0)), {B{l).D{l)), ■■■ , {{B{k),Dik)) = {j, m))) 

is a function only of {j,m). We need the following simple lemma (in which the new notation is local to the lemma). 

Lemma 1: Given (Xi, ^2, • • • , X„) i.i.d. random variables with common c.d.f G{x), x > a, where a > 0, and given b > a, 
define 

Z ^ {i:l<i<n,X,>b}, and Z = \Z\. 

Let Yi, I2, ■ ■ ■ ,Yz he random variables jointly distributed as X^, i G Z (ordered in the increasing sequence of indices). Then 

P(r, < y., 1 < z < Z\Z = m) = UZ.^^y'^ ' ^^^^ 



1 - G{b) 



for yi > b,l < i < m. 

Proof: (of Lemma [T]) It is easily seen that 



and 



P{Z = m,yi < y„ • • • , r^ < y^) = ( ^ ) (G(6))"-'" H™ ,(G(2/0 - G{b)) 



P{Z^m)=r\ (G(6))"-"(l-G(6))^ 



from which the desired result immediately follows 



Returning to the proof of Proposition [T] given (B(0), -0(0)), (i?(l), £>(!)), • • • , {{B{k),D{k)) = (j, to)), via a recursive 
application of Lemma [T] (starting from the initial i.i.d. thresholds 8i, 1 < i < iV, with common c.d.f. F{6)), we conclude that 
the thresholds of the users in A/" \ {B{k) U 'D{k)) are i.i.d. with common c.d.f. {Spi -. , over the range 9 > ^j. At the 
end of period k, the newly interested users in I'(fc) will serve as additional influence on the users in Af \ {B{k) U Vik)). Of 
these, £ will become interested (i.e., D{k + 1) = i) with probability 

/(iV-(j+TO))\ / f(7(j+m))-f(7j) y / J^(7(J+H)- J^(7j) V^''-'+""'' 

I t )\ 1-F(7J) ) \ 1-F{13) ) 

which depends on the "history" only via {j,m), and thereby establishes the desired result. ■ 



Recalling from Section 



m-A 



Appendix B 
Fluid Limit for the HILT Model 

the evolution of {B^ (t), D^ (t)) can be written in terms of its mean "drift" at any t as follows. 



D'^ {t + 1 



E 



B^(t + l) = B^(t) 
A^- 1 ~ 



N 



+^r(i+i) 



N 



D'\t) 



F{^nN{B^ (i) + C^ it))) - F{^nN{B^ (i)) 



1 - F{-iNN{BN{t)) 
{l-B'^it)-D^it)) + Z^it + l) 

where we have defined Z^ (t) and Z^ (t) in an analogous manner. The mean drift rate function, g'^ {B'^ [k) , D^ {k)) per unit 
update step size -^ becomes, 

'"<'^"">''^"<">-^f(.)..,f(; 






where 



g'^i-) 



NE 



FijNN{B^it) + C^it))) - FijNNiB^it)) 



1 - F{jNN{B^{t)) 
1 - B'^ (t) - D'^ (t)] -L)^(t) 



Recall jN X (^^1) = T and f{x) the density function of the threshold distribution F. We can then show that (see Appendix [c|, 

'F{jNN{B^it) + (7^(t))) - F(7^iV(B^(t))- 



lim A^E 

N-^ao 



l-F{-fNNim{t)) 



1 - F{Tb) 



Letting gi (6, d) = d and 52 (^, d) 



i-F(rb) (^ ^ b — d) — d and define. 



gib,d) := gi{b,d),g2ib,d) 



We can then use these limiting drift functions to obtain the fluid limits corresponding to the HILT model. 

Appendix C 
Consider 

liiTiN^ooNE 



FijN{B^{t) + C^it))) - Fi-,N{B^{t)) 



1 - F{-tN{B^{t)) 

where the expectation is wifli respect to C^{t) given (i?^(i), D^{t)). Applying Taylor's expansion to F(7iV(i3^(t)+C'^(i))) 
around B^ [t) we get. 






F{-fN{B^{t) + C^{t))) ~ F{-fN{B^{t)) 



l-F(7iV(B^(t)) 
jNC^{t)f{jNm{t)) - 

1 - F(77V(B^(i)) 



In the second term since C^ (t) = ^^ where C^ {t) - Biii{D^{t), ^), we have 



Nu-\2\ 



E{C'\t) 



iV2 



E(C^"(i)^) 



1 

iV2 
1 

iV2 



D^Vt) — (1 



N' 



' N ' 



D'^{m-j^) + iD''{t)r 



Also f{Q is bounded, since we require the drift functions to satisfy the Lipschitz condition. Hence the second term vanishes 
as N ^ oo. In the first term, noting that ^fN -^T and E{C^{t)) = ^^^^^, given B^{t) = b, D^{t) = d, the Hmit becomes 



1-F{rb)- 



Appendix D 
Fluid Limit for the HILT-SI Model 

Adopting the scaling as explained in Section IV we can write the evolution of {X^ {t),Y^ (t)) as follows: 



X^ {t + 1) - X'^ {t) 
= XaN 



{X^{t) + Y^{t))^^j, 



N 



iA^'{t)-X^'it)) 



n"(t) 



l-jNB^it) 






= AcTjv- 



N 



{N~A'\t)-Y'\t)) 



D"{t) 



l-7jvS^(i) 



As earlier, defining X^{t), Y-^{t), A^{t), B^{t), D^ (t) as the fractions of entire population, we can write the mean drift 

functions per unit update size for X^(t) and Y^{t) respectively as, 

g^{x,y,b,d) f .r/ , \f ^ , iNNd 

— ' AaNN[x + y){a — x) + ^^^y 



j_ 

AT 



gy{x,y,b,d) 



N 



1 - jNNb 
Xa^Nix + y){l-a-y)- ^ J^^^^ V 



Let gx{x, y, b, d) = Xa{x + y)(a - x) + jzrnV ^^^ 9y{x, y, b, d) = Xa{x + y)(l 



y) — i^'^b V- Combining these with 



the limiting drift functions of the HILT model (see Appendix |B]l we can derive the fluid limits of the HILT-SI model. 

Appendix E 
Kurtz's Theorem and Proof of the Theorem[T] 

Kurtz's theorem ifTTl provides us a way by which we can, subject to certain conditions, approximate the evolution of a pure 
jump Markov process by the solution of a derived ODE. In this section, we derive the necessary conditions for the convergence 



of the pure jump Markov process to the ODE solution, along the lines of Kurtz theorem, and subsequently check them for the 
HILT model. 

For each iV > 1, F(^^(fc), fc > 0, is a DTMC on A^^^ C A where A is a subset of an appropriate dimensional Euclidean 
space. Let e, , 1 < j < J*-^-*, be the possible jumps of Y''^\-) and p, {y),y € A^^^, the jump probabilities, i.e., 

pW(y) :^ P(r(~)(fc + 1) - r(^^)(fc) ^ ef^lY^^Hk) - y) 

We define an indicator variable, Ij{Y^^''{k)) = 1 if the jump out of Y'-^\k) is e^ . Then we can write 

Y^i^k) = rW(0)+^^ef^/,(rW(z)) 

i=0 j = l 

= rW(o)+x:i:'e5.%r(r(-)w) 

i=0 j = l 

where in the second equality, the second terms comprises the sum of the successive mean jumps, and the last term the sum of 
successive random "noise" terms. We rewrite these compactly by defining, for i > 0, 
Define, for ? > 

j(JV) 

e^W := ^ef\I,iY^^\^))-pf\Y^^\^))), 
j=l 
so that 

1=0 1=0 

where f^^'{y) '■— X]i=i ^j P] iv) is the expected drift out of the state y e A^^'. 
With J"^^' :=cr(yW(0),r(^Hl),--- .y^^H't))' we easily see that 
Lemma 2: Z'^'^\k) :— 'J2i=a C*'^''(*)' ^ > 0, is a martingale with respect to J^l , fc > 0. 
For each i > 0, for each N, we can write 

r(^)([7Vij) (15) 

[Nti-l 

= rW(0)+ E f''''\Y<'''Hi)) + Z<^''\lNt\) (16) 

i=0 

Suppose 

II ■^'^■'(2;) rt Ml N^oc .. ,^„ 

sup II 1 /(a;)|| -> (17) 

with /(a;) being Lipschitz on A, i.e., for all x,y ^ A, for some L > 0, 

||/(a:)-/(2;)||<^||2:-y|| (18) 

Now, in ( fTSJ ), if y^^)(0) — > 2/(0), and the second term goes to 0, the process Y^^\[Nt\),t > 0, appears to be following 

y{t) = y(0) + / f{y{u))du 
Jo 

Take t e [0,T]. For given t, subtracting the ODE solution on both sides of ( [T5| ) 

Y<^^\[Nt\)-y{t) (19) 

= rW(0)-y(0)+ ^ /^(^^''^H^)) (20) 

1=0 

fiyiu))du + Z^^\lNt\) (21) 



We will study each of the three "error" terms on the right hand side of this equation. First, we consider the term 

X: /W(y(^)«)- / f{y{u))du 



[Ntl-l [Ntl-l 

i=0 1=0 

lNt\-l lNt\-l / / . ■ 

i=0 i=0 ^ 



N 



y -/ 



i=0 



TV 



f{y{u))du 



(22) 
(23) 
(24) 
(25) 



Term ( p3| l; Using the uniform convergence hypothesis, for each t, and for an ei > there exists an N large so that, 

[Nti-l [Nti-1 

Y: /(^)(r(^)w)- E ^fiy'''H^)) 



[Ntj-l 



< 



< 



i=0 Af 



i=0 

LJVtj-1 



^ r^ 



lNt\ 

N 



-ei 



Term ( |25] l; By the hypothesis in ( fTT] ), /(•) is Lipschitz on A. The following can then be proved: 

1) ||/(y)|| < 11/(0)11 + L\\y\\ (i.e., at most lineai- growth in ||7;||) 

2) The ODE y{t) — f{y{t)), with initial value 2/(0), has the unique solution y{t) = y{0) + J^, f(y{u))du. Further, over 
< t < T, there are constants such that 

a) \\f{y{t)\\<K,iT){l + \\ym\) 

b) \\yit + r)-yit)\\<K^iT)il + \\ym\)r 



lNt\-l 



vJM.m-1 



f{y{u))du 



i=0 

[Nti-1 

^ E 

i=0 
■t 



< 



N 

lNt\-l 

E 

i=0 

[Ntj-l 



^Ani^ 



f{y{u))du 
1 



f{y{u))du 



2N 



f[y 



i + l 
N 



-fly 



N 



c'AT) 



N 



^ E c^iT)^+c[iT)j^ 
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where the penultimate inequality follows from the Lipschitz property of /(•). 
Term ( |24] i.- By using the Lipschitz property of /(•) 

VNt\-l VNt\-l 

E W^^)~ E Uy" 



i=0 



N 



< 



[Ntj-l 

E 



N 



(rW(z))-y 



N 



We now work on the left hand side of ([T9| 

rW(LA^iJ)-y(i) 



> 



Y^^'Him) - y (^) - [yit) - y y ^ 



im 



Y^''Hm)^y(^^~^\\-\\(yit)^y 



N 



and it can be shown that, with y{t) = y{0) + /g f{y{u))du, for t e [0,r], 



We now use the hypothesis that Y^^^O) A- y{0) 

Then, for large enough N, combining ( |23| l, ( p5] l, ( p4] i, ( |26]l , ( |27 



< C4(r) 



1 

iV 



I with ([T9]l for all t G [0, T], we can write 



TV 



[Ntj-l 



s '+ s ^^ 



i=0 



(>^^^HO)-y(^) 



zW([A^tJ) 



We now work on the "noise" martingale, using Burkholder Inequality(BI) for zero-mean martingales. 



E I sup 

vO<t<T 



Z'^^\lNt\) 



( 



sup 

0<t<T 






i=0 



[WTJ-l 



[NT\ 



< CiCo- 



iV2 



Where we have required that E IIC^^H*)!! — coj^j, for all i. 
It follows that 

Pf sup Z^^\lNt\) 

\Q<t<T 



< £ ^1 



(26) 



(27) 



(28) 



Theorem 6: (Gronwall's Inequality) Given a positive sequence a„,n > 0, and b,c > 0, if, a;„,n > 0, is a nonnegative 
sequence such that, for each fc > 1 



fc-i 



Xk 



< b + C^^ O-iXi 



then, for each fc > 1 



Xk 






From 



28 



on a sample path where supQ<«y ||Z(^)([iVtJ)|| < e and ||y^(0) — j/(0)|| < epsilon, we can write, for 



o< fc <T7vrj, 



Y^''Hk)-y{^) 



fe-i 



^ 3.+E^^ (>^<")«)-2/(^) 



i=0 



Hence, by Gronwall's inequality, for any e > 0, 

P( sup ||zW(LA^tJ)|l< 
o<t<r 



< 



sup 

^0<k<[NT] 



Y^^\k)-y{^) 



LlNTj 

< 2ee^^ 



By the earlier assertion, the left hand term ^ 1 as A^ — > cx), hence so does the right hand term. Hence we can state the 
following theorem. 

Theorem 7: With the notation defined earlier, if, for all fc > 0, 



/ 



j(") 






\n < Co 



1 



Further, supj,g^(jv) j^^"'-^ — f{x) "— >°° Q with f{x) being Lipschitz (i.e., for all x,y € A, for some L > 0, \\f{x) — f{y)\\ < 

i ||x - y||). Define 2/(t)"o < f < T, by y(f) = 2/(0) + /„ /(y(7i))dw. Also, r(^)(0) 4 y(0). 
Then, for each T > 0, and each e > 0, 



P I sup 

vO<i<T 



yW(Lmj)-y(i) 



> e ^ 



Appendix F 
Application of Kurtz's theorem 

A. Verification of Kurtz's theorem for HILT model 

Now we can check each of the above conditions in Theorem |7] for the HILT model, 
(i) Lipschitz property 

Consider, 

gi{b,d) =d 

g2ib, d) = hF{Tb)rd{l ~^b-d)-d 



dgi 
db 



' dd 



%2 

db 



T^dhF{Tb){l -b-d)~ TdhpiTb) 



%2 

dd 



hf{Tb)T{l-b-2d) -1 



Under the assumption that /(•) is bounded, we see that each of the terms above is bounded when (6, d) e [0, 1] x [0, 1 — 6]. 
Thus the norm of Jacobian \\Dg{b, d)\\ is uniformly bounded, and it follows that g{b, d) ~ {gi{b, d),g2{b, d)) is Lipschitz. 
(ii) Uniform Convergence 
We know that 



1 



g'^B'^k),D'^k)):=-[D'^k),NE 



F{jNN{B^it) + C^{t))) - F{jNNiB^{t)) 



l-F{jNN{BN{t)) 



1-B^(t)-D^(t) -D'^it) 



Let gi{b, d) ^ d and g2{b, d) = jzrpuYb)^^ — b — d) — d and define, 



g{b,d) := gi{b,d),g2ib,d) 



C 



the uniform convergence of ^ -^^/j^ to g{b, d) in the 



By definition, lim^v^oo ^7 = T and by the steps in Appendix 
domain {b,d) £ [0, ;^, • • • ,N] x [0, ;^, • • • ,6] is proven, 
(iii) Bounded Noise variance We will write the noise variance terms for the scaled processes B^ {k) and D^ (k), and from 
them derive the noise variance terms for the density dependent processes by scaling them by ■^. 

Let Z^ (k) and Z^ (k) be the zero mean random variables representing the noise in the drift terms of B^ (k) and D^ [k] 
respectively. We can then write, 

E[\Z^{k)\''\F,] = {l-^)^D^{k) 



1 . 1 



z=D^{k) 



E[\Znk)\^\F,] = {l--)-D^{k)+ Y. g{z){l^g{z)){N^B^{k)-D^{k)) 



z=0 



z IN ^ n' 



where, 

_ F{-fN{B^{k) + z)) - Fi-yNB'^ik)) 

^^^' " l~F{-fNBN{k)) 

Note that both these terms can be upper bounded by constants say c^ and Cd and hence the noise conditions for B^ [k] 

and D^ [k] are satisfied. 
(iv) Convergence of initial conditions By choice, we have B^ {Q) ~ 6(0) and l)^(O) = cf(0). 
Thus by Theorem [7] we have for each T > and each e > 0, 

p( sup ||(B^(LiVij),^^(L7vtj))-(6(i),d(t))||>6V4°°o 

\0<t<T / 

where (6(t), d{t)) is the unique solution of the ODE 

b{t) = d{t) 
d{t) = -dit) + ^^^(1 - b{t) d{t)) 

with initial conditions (6(0) = 0,d{0) = a(0)). 

■ 

B. Verification of Kurtz theorem for HILT-SI model 

The first two equations, as proved above will remain unaltered. Consider the drift equations for X^ {k) and Y^ [k) and the 
corresponding ODEs. We can use a similar procedure as above to verify the conditions for applying Kurtz's theorem, 
(i) Lipschitz property 
Consider, 

9x{x,y,b,d) = Xa{x + y){a-x) + _^ _ -^^ y 

gy{x,y,b,d) = Xa{x + y){l - a - y) - ^3^6^ 

where a = b + d. We can calculate the partial derivatives of g^. and gy with respect to x, y, 6, d and note that given 
6 + (i<l, a::<6 + (i, 2/<l — 6 — d and all quantities non-negative, we have that the Jacobian is uniformly bounded and 
hence the drift functions are Lipschitz. 
(ii) Uniform Convergence We know that, 

g^{x, y, 6, d) = — f XaNN{x + y){a-x)+ ^^ ^^ y 



yy {x,y,b,d) = —lXaNN{x + y){l-a-y)- ^_ ^^ y 

Define, 

9x{x,y,b,d) = Xa{x + y){a-x) + ^_j., y 

gy{x,y,b,d) = Xa{x + y){l - a - y) - ^ _ -^^ y 

Since as iV — ^ oo, we have 7ArA^ -^ F, a^N — > a, gnN —> a and we see that the uniform convergence is straightforward, 
(iii) Bounded Noise variance As earlier we will write the noise variance terms for X^ (h) and y^(fc) and derive the density 
dependent process' noise variance by scaling by ^. 

Let Z^ (k) and Z^ (k) be the zero mean random variables representing the noise in the drift terms of X^ {k) and Y^ [k] 
respectively. We can then write. 



D"(fc) 



7^1 (. 7^1 \^Nn.^(D''{k)\l\^ 1 



^o-"wiV.i - Er3^i^('-r3^i^j>-<K =; J^.<'-^'°""' 



21^0 

X^ ik)-\-Y^ (k) 



-ii-."wi'i-d ^ E r^^l^ O-T^^i^) -"'<"?) ^>-^)''"'"- 



E ^(i-^x^-^-nir'^t" ''')^>-^)^"-"^'""- 



zi=0 

Z2=0 \ ^ / 

We see that the above two terms can be bound by constants c^. and Cy respectively and we can see that the noise conditions 
are satisfied, 
(iv) Convergence of initial conditions By choice, we have X^{0) = x{0) and Y^{0) — y(0). 
Thus by Theorem l7j we have for each T > and each e > 0, 

p( sup \\{B''{lNt\),D''{[Nt\),X''{lNt\),Y''{lNt\)) - {b{t),d{t),x{t),y{t))\\> e) "^4°° 

\0<t<T J 

where {b{t),d{t),x{t),y{t)) is the unique solution of the ODE 

b = d 
d^^^^il-b-d)-d 

X = \a{x + y)[a - x) + ^ _^^ y 

y = \(T{x + y){l -a-y) 



l-Tb- 
with initial conditions (6(0) — 0, d(0) = do,x(0) — XQ,y{Q) = 0). 



Given the system of ODEs 



Dividing d by h. 



By separating the variables, 



Appendix G 
Solution of the ODE 



b = d 
d = -d+^-^(l-6-d) 



dd Tb~b~ r62 



dfc d(l - Tb) 



Ad db 



T -1-Td l-Vb 
Integrating on both sides, and after taking anti-logarithm 

r- 1 -rd = ci(i-r6) 

Differentiating on both sides yields d — cid and hence d{t) — C2e^i*. Substituting in the above equation for d{t) we get 

m = '^e- + 1±^ 

ci Tci 

Solving for constants using the initial conditions 6(0) = 0, d(0) = do results in cl = —(1 + Tdo — T) =: —r and C2 = do- 
Hence we have, 

fe(t) ^ ^ _ ^e-^* 
r r 

d{t) = doe"''* 



Appendix H 
Convergence of h^^\k) to br, 



h'fHk) 



The solution of the ODE suggests that limj^oo b{t) = do/r. This is consistent with the fact that linit^oo —^ 
— ^ boo as we now proceed to show. 



i-(i-do)r 



h^f\k) = k 



1 + {N- k)j 



1 + {N -k-lYf 



h^f\k + l) = {k + l) 



i + {N -k-iy/ 



Thus we can write, 



/if)(fc) = fc 



l+-f{N -k) 



fc + 1 



Now substituting for k ~ doN and noting that F = •^N we have 



-^Vd^-'^^^'-'"^ Nido + ^) 
Taking N ^- oo and noting that h\ (k) is a continuous function, we have 

1 hi,''\Ndo) ^ ^ ^ r(l - do) h^f\Ndo) 



do N 



Take Hmits on both sides, and solving for the unknown, 

hif^\Ndo) 
iV ^ 



do 



1 - (1 - do)r 



N 



